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PATENT 



REMARKS 

Status of the claims 

With entry of the instant amendment, claims 1-6 have been amended. The 
amendments add no new matter and are supported throughout the application and claims as filed. 

Claims 1-32 are pending in the application. Claims 1 1-24, 31, and 32 have been 
withdrawn by the Examiner. Claims 1-10 and 25-30 are therefore under examination. 

Objection to the specification 

The disclosure was objected to for informalities regarding the sequences shown in 
Figures 5 and 6. Applicants note that the sequence listing filed July 2, 2004 included SEQ ID 
NOs for the sequences depicted in Figures 5 and 6. The amendments to the specification 
presented in this paper provide the SEQ ED NOs. Applicants therefore respectfully request 
withdrawal of the objection. 

Restriction requirement 

The Examiner has made the restriction requirement final. Although Applicants 
disagree with the restriction for reasons of record, claim 1 has been amended to delete the 
alternative recitation that the polymerase has at least 80% identity to SEQ ID NO:25. 

With regard to the Examiner's contention that claim 1 is not a 'genus type 1 claim 
(page 3 of the Office Action), Applicants disagree. Claim 1, both before and after amendment, 
encompasses the species set forth in claim 7 as well as other species that possess the claimed 
characteristics. Accordingly, it is properly a generic claim. Applicants also note that claim 1 as 
it relates to the elected reference sequence SEQ ID NO:24 is under examination as a generic 
claim (see, the enablement rejection outlined by the Examiner). Thus, the Examiner's contention 
that restriction of this claim would still allow the claim to be fully considered on its merits (page 
3) appears to relate to the restriction as applied to the reference sequences SEQ ID NO:24 and 
SEQ ID NO:25. If this interpretation is incorrect, Applicants respectfully request clarification. 
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With regard to the restriction applied to SEQ ID NO. 2, the polymerase of SEQ 
ID NO:2 corresponds to the polymerase region of SEQ ID NO:4. Specifically, SEQ ID NO:4 is 
a fusion polymerase in which the polymerase domain (SEQ ID NO:2) is fused to Sso7d (see, 
e.g., the last two sentences of paragraph 155, the descriptions of SEQ ID NOs 1-4, and the 
sequences of SEQ ID NOs 2 and 4). The restriction of SEQ ID NO:2 and SEQ ID NO:4 into 
separate groups is therefore additionally improper. 

The invention 

The current claims relates to hybrid polymerases that have residues from more 
than one parental polymerase protein. Such polymerases are conveniently generated, e.g., by 
targeting amino acid residues that are not conserved in the parental polymerases. Thus, such a 
method can involve aligning the parental polymerase amino acid sequences, and at positions that 
are not identical, introducing either one or the other parental residue that occurs at that position 
into the polymerase. 

Rejection under 35 U.S.C. § 11 2, first paragraph— enablement 

Claims 1-6, 8, 9, 10, 25, and 27-30 were rejected as not properly enabled by the 
specification. The Examiner alleges that it would require undue experimentation to determine 
polymerases that fall within the scope of the claims. The Examiner first contends that one of 
skill in the art would not be able to determine mutations to be introduced into a polymerase that 
would not adversely affect function. Specifically, the Examiner alleges that the specification 
provides no guidance as to what residues can be changed. The Examiner also contends that it is 
not routine in the art to screen for multiple substitutions and that one of skill in the art would 
expect that any tolerance to the modification of a given protein would diminish with multiple 
substitutions. Applicants respectfully traverse this rejection. 

First, the application in fact provides guidance as to which residues can be 
substituted in the claimed polymerase sequences. Furthermore, the hybrid polymerases 
exemplified in the specification demonstrate that more than one substitution can readily occur in 
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a parental polymerase sequence, e.g., SEQ ED NO:24, without rendering the polymerase non- 
functional. 

In addition, this is a highly advanced art. Extensive structure-functional analyses 
have been performed on archaeal polymerases. Such information is readily available to one of 
skill in the art to further determine residues that can reasonably be expected to tolerate 
substitution. 

Thus, as explained below, the examples detailed in the specification, the guidance 
provided by Applicants for selecting residues for substitution, and the level of knowledge in this 
advanced art allow one of skill to identify members of the claimed genus of polymerase proteins 
without undue experimentation. 

The application provides guidance 

The Examiner alleges that the specification provides no guidance as to sites 
within the polymerase protein reference sequence(s) that tolerate substitutions. However, this 
assertion is incorrect. The genus of proteins set forth in claim 1 is derived from two parental 
polymerases, Pfu and Deep Vent®. The reference polymerase sequence SEQ ID NO:24 is the 
sequence of Pfu polymerase. A member of the claimed genus (claim 1) is therefore 
characterized as follows: (i) it has the specified identity to Pfu polymerase; (ii) it comprises the 
SEQ ID NO:23 signature sequence, which is the nucleotide binding site; and (iii) it comprises at 
least one substituted residue at a site that is not conserved between Pfu and Deep Vent® where 
the substituted residue is from the Deep Vent® parent polymerase sequence. 

In teaching how to make members of this genus, the specification provides an 
alignment of the Pfu and Deep Vent® sequences (Figure 1) where "X" indicates positions at 
which the sequences vary. The two polymerases differ from one another at 1 15 locations and 
have 85% identity over their length {see, e.g., paragraph 144). The specification teaches that 
hybrid polymerases can be generated based on alternatively incorporating a residue from either 
one or the other parent polymerase at sites of variation in the sequence(e.£., paragraphs 145- 
148). The specification also teaches that conservative substitutions can be introduced into amino 
acid sequences that are reasonably expected to preserve function (see, e.g., paragraphs 59, 60, 
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and lines 3-13 on page 15). Thus, the specification provides extensive direction to the artisan as 
to the type and placement of substitutions in a polymerase sequence. 

The specification provides examples 

In addition to the guidance provided in the specification at the passages noted 
above, Applicants have taught specific examples of active polymerase proteins that have been 
obtained by following the teachings in the specification. For example, the Examples section at 
pages 39 and 40 teaches at least six hybrid polymerase proteins that were isolated from libraries 
constructed to provide hybrid polymerases having the characteristics set forth in claim 1 
(paragraphs 145-150). Four full-length clones were assembled from one set of libraries 
(designated "Hybl to Hyb4"); two full-length clones ("Phyl" and "Phy2") were assembled from 
a second collection of libraries. These six polymerases were expressed and purified. All of these 
six hybrid polymerase proteins had DNA polymerase activity (paragraph 154). 

Five additional hybrid polymerase clones were isolated from the second library as 
Sso7d fusions and were designated PhS3 to PhS7. The polymerases were tested for polymerase 
and exonuclease activity (paragraph 159). Table 1 provides a summary of characteristic of PhS3 
to pHS7 and other hybrid polymerases exemplified in the specification. Table 1 shows that the 
polymerases indicated as PhSl, PhS2, PhS5, Ph7, Hybl, Hyb2, Hyb3, and HyS4 (SEQ ID NOs. 
4, 6, 8, 10, 12, 16, 18, and 20, respectively) all exhibit activity. HySl (SEQ ID NO:14), which is 
also active, is a Sso7d fusion protein comprising the polymerase Hybl (SEQ ID NO: 12) {see, 
e.g., Table 1). PhSl (SEQ ID NO:4) is an Sso7d polymerase fusion comprising a polymerase 
having the sequence of Phyl polymerase SEQ ID NO:2. 

In summary, the parental polymerase proteins, Deep Vent® and Pfu polymerase 
share at least 85% identity over 700 contiguous amino acids of SEQ ID NO:24. The 
specification further teaches that residues, e.g., residues that are not conserved in the two parent 
polymerases, can be altered to incorporate a residue from a different polymerase to derive hybrid 
polymerases. The claimed polymerase proteins also have the nucleotide binding domain site 
sequence set forth in SEQ ID NO:23. Applicants have exemplified at least eight active distinct 
polymerase sequences that were obtained from libraries designed to provide polymerases having 
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characteristics recited in claim 1. For example, PhSl has 60 residues that are altered from the 
Pfu parent reference sequence to the Deep Vent® parent residue (see, sequences in Figure 5 for 
comparison). Other hybrid polymerases, e.g., PhS2, have additional sites that are varied with 
regard to the parent Pfu sequence that are not altered in PhSl. For example, the residues at 
positions 28 and 30, among others, are unchanged in PhSl relative to Pfu, but are changed in 
PhS2. The disclosure therefore in fact provides considerable guidance to one of skill as to which 
residues of the Pfu reference sequence can tolerate substitution. In view of the foregoing, one of 
skill in the art has a reasonable expectation that hybrid polymerase proteins as claimed that retain 
activity, e.g., either polymerase activity and/or exonuclease activity, can readily be generated in 
accordance with Applicants* teachings. 

The prior art provides detailed structural and functional analyses of polymerase B proteins 

In addition to the specific guidance and examples provided in the specification, it 
is noted that this is an advanced art. The structure of Archaeal family B polymerases is known. 
For example, Hopfiier et al, Proc. Natl. Acad. Sci USA 96:2600-2605, 1999 ("Hopfiier", 
attached as Exhibit A) provides a crystal structure of an Archaeal DNA polymerase. Hopfher 
teaches that the structure of the Thermococcus gorgonarius (Tgo) polymerase allows the 
generation of a structure based sequence alignment of the archaeal subfamily of type B DNA 
polymerases, including Pfu polymerase (p. 3603, column 2, second full paragraph and Figure 3). 
The authors additionally explain that the location of the polymerase active site and the residues 
at that site are maintained in the archaeal family (see, e.g., p. 3603, second column, second to 
fifth full paragraphs). Furthermore, the active site for the 3 1 to 5' exonuclease was also 
characterized (p. 3603, second column, sixth full paragraph). Hopfiier thus demonstrates that a 
considerable body of knowledge relating to the structure of archaeal polymerases was available 
in the art at the time of Applicants' invention. Accordingly, one of skill could also consider this 
structural information to identify or generate members of the claimed genus that can reasonably 
be expected to have activity. 
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The specification teaches one of skill how to practice the claimed invention 

As long as the specification discloses at least one method for making and using 
the claimed invention that bears a reasonable correlation to the entire scope of the claim, then 
the enablement requirement is satisfied (MPEP § 2164.02(b), emphasis added). Applicants 1 
specification meets this requirement. 

Applicants have provided a rational scheme for producing the claimed 
polymerases, as evidenced by the exemplification of multiple functional hybrid polymerases 
obtained following this scheme. Applicants have provided guidance as to which residues in the 
parental polymerase proteins can be successfully substituted. Moreover, the advanced state of 
the art provides the practitioner with additional information regarding polymerase structure and 
function to further identify regions of the parental polymerase that can be modified with a 
reasonable expectation of retaining the biological function. 

As the Examiner knows, it is not the quantity of experimentation that is required 
to practice the invention that determines whether the disclosure properly enables the claims, it is 
whether such experimentation is reasonable. It is well-settled in the biotechnology art that 
routine screening of even large numbers of samples is not undue experimentation when a 
probability of success exists. In re Wands, 858 F.2d 731, 8 USPQ2d 1400 (Fed. Cir. 1988). As 
stated in Wands, "enablement is not precluded by the necessity for some experimentation, such 
as routine screening." In re Wands, at 1404. The fact that experimentation may be complex 
does not render it undue. Applicants have demonstrated that the teachings in the specification, 
particularly in view of the high level of skill of the ordinary practitioner in this advanced art, 
provide adequate guidance such that one of skill in the art could practice the invention with a 
reasonable expectation of success. 

In view of the foregoing, the disclosure properly enables the invention over the 
scope of the current claims. Applicants therefore respectfully request withdrawal of the 
rejection. 
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CONCLUSION 

Applicants believe all claims now pending in this Application are in condition for 
allowance. The issuance of a formal Notice of Allowance at an early date is respectfully 
requested. 

If the Examiner believes a telephone conference would expedite prosecution of 
this application, please telephone the undersigned at 415-576-0200. 




TOWNSEND and TOWNSEND and CREW LLP 
Two Embarcadero Center, Eighth Floor 
San Francisco, California 941 1 1-3834 
Tel: 415-576-0200 
Fax: 415-576-0300 
JML:jml 

60941022 v1 
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Crystal structure of a thermostable type B DNA polymerase from 
Thermococcus gorgonarius 

(x-ray structure/disulfide bonds/replication/Archaca/exonudease) 
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ABSTRACT Most known archaeal DNA polymerases be- 
long to the type B family, which also includes the DNA 
replication polymerases of eukaryotes, but maintain high 
fidelity at extreme conditions. We describe here the 2.5 A 
resolution crystal structure of a DNA polymerase from the 
Archaea Thermococcus gorgonarius and identify structural 
features of the fold and the active site that are likely respon- 
sible for its thermostable function. Comparison with the 
mesophilic B type DNA polymerase gp43 of the bacteriophage 
RB69 highlights thermophilic adaptations, which include the 
presence of two disulfide bonds and an enhanced electrostatic 
complementarity at the DNA-protein interface. In contrast to 
gp43, several loops in the exonuclease and thumb domains are 
more closely packed; this apparently blocks primer binding to 
the exonuclease active site. A physiological role of this 
"closed" conformation is unknown but may represent a poly- 
merase mode, in contrast to an editing mode with an open 
exonuclease site. This archaeal B DNA polymerase structure 
provides a starting point for structure-based design of poly- 
merases or ligands with applications in biotechnology and the 
development of antiviral or anticancer agents. 



Propagation of cells requires faithful DNA replication. This is 
performed in vivo by DNA polymerases (pols), which attach 
the appropriate dNTP to the nascent DNA primer strand to 
match its paired template. Different families of pols are 
involved in different DNA polymerization processes including 
not only DNA replication (1,2) but also repair and recombi- 
nation (3, 4), a heterogeneity also reflected by varying 
polypeptide structures and/or subunit compositions (3, 5). 
Some pols complement polymerase activity with 3' -►5' 
exonuclease activity (editing activity) and/or 5' — > 3' "struc- 
ture-specific endonuclease" activity, often located in separate 
structural domains on the same polypeptide chain (4-8). 

Crystal structures are available for most known polymerase 
families, including the A family DNA polymerases (9-14), pol 
j3 (15-17), HIV reverse transcriptase (18-20), and recently, 
the B family pol gp43 from bacteriophage RB69 (21). All share 
a functional polymerase structure, which resembles a right 
hand built by the palm, fingers and thumb domains (see ref. 7 
for review). Although the fingers and thumb domains are 
highly diverse among the different families, the palm domains, 
which contain the conserved catalytic aspartate residues, show 
a similar topology among all families except pol 0. The 
polymerase nucleotidyl transfer was studied in detail for the A 
family polymerases, HIV reverse transcriptase, and pol j3, and 
was shown to involve two metal ions (summarized in ref. 7). 

Considerably less is known for the family of type B pols, 
which are replicative enzymes in eukaryotes and most likely 
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also Archaea (22, 23). The structure of gp43 from bacterio- 
phage RB69 (21) provided an excellent first insight into this 
family. In addition to the three polymerase domains, gp43 
contains an 3' -> 5' exonuclease domain and an N-terminal 
domain. The exonuclease and palm domains share the topol- 
ogy and active site of A family enzymes, implying similar 
metal-assisted mechanisms for polymerase and exonuclease 
activities (21). The thumb and finger domains are apparently 
unrelated to the other polymerase families. The function of the 
N-terminal domain remains unknown, but may help assemble 
the multicomponent replication apparatus (21). 

Much is known about the replication of phages (24-26), 
viruses (1, 27), Prokaryota, (28) and Eukaryota (1, 3, 29, 30), 
which in general involves pols but also primases, helicases, 
RNaseH, sliding clamps, and other factors (31). Considerably 
less is known for archaeal replication, where mostly B type 
polymerases, similar to eukaryotic replication enzymes pol a 
and 8, have been identified (6, 22, 23, 32-34). This relative 
ignorance is surprising, because such crucial biotechnological 
applications as cloning and PCR require the thermostability 
and fidelity typical of archaeal polymerases (6). Thus, in 
addition to satisfying basic research interests, structural infor- 
mation could assist, for example, the engineering of variant 
enzymes with tailored nucleotide incorporation rates or the 
design of antiviral and anticancer polymerase inhibitors. For 
these reasons, we have determined the structure of a DNA 
polymerase from Thermococcus gorgonarius (Tgo), an ex- 
tremely thermophilic sulfur-metabolizing archaeon isolated 
from a geothermal vent in New Zealand (35). This enzyme 
possesses pol and a 3' — > 5' exonuclease activity, which 
together ensure thermostable replication with high fidelity 
(error rate: 3.3-2.2 X 10~ 6 ; see ref. 36). The 2.5 A structure 
shows a topological similarity to gp43 and gives insight in the 
structural biology of archaeal DNA polymerases, including the 
identification of several mechanisms for thermophilic adapta- 
'tion. 

MATERIALS AND METHODS 

Materials. All materials were of the highest grade commer- 
cially available. 

Bacterial Strains. Escherichia coli LE392 containing 
pUBS520 was used as described (36). E. coli B834 (DE3) (hsd 
metB) was a generous gift of Nediljko Budisa (Max-Planck- 
Institut). 

Expression Vectors. PBTac2 was obtained from Roche 
Molecular Biochemicals. 



Abbreviations: Tgo, Thermococcus gorgonarius; pol, DNA polymerase. 
Data deposition: The atomic coordinates have been deposited in the 
Protein Data Bank, Biology Department, Brookhaven National Lab- 
oratory, Upton, NY, 11973 (PDB ID code 1TGO). 
tPresent address: The Scripps Research Institute, La Jolla, CA 92037. 
*To whom reprint requests should be addressed, email: hopfner@ 
scripps.edu. 
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Heavy-atom derivatives were prepared by soaking the crystals in low-salt buffer containing the heavy atom as follows: U, 0.5 mM uranyl acatate 
for 2 h; PT1, 5 mM K 2 PtCl 4 for 1 d, PT2, 5 mM K 2 PtCl 6 for 2 d; PT3, saturated cw-dichlorodipyridine-Pt(II) for 2 d; PT4, 5 mM K 2 PtCI 6 for 7 
d; PT5, saturated cw-dichlorodipyridine-Pt(II) for 7 d; PTU, PT1 + U; PB, saturated dinitrophenyl-Pb(N0 3 >2 for 7 d; PBPT, PB -f PT5; OS, 
saturated K 2 0s0 4 for 7 d. NAT1, PT1, PT2, PT3, PTU, PB and PBPT were collected with a Mar imaging plate and U was collected with a Bruker 
AXS area detector on a Rigaku rotating anode source. All other data sets were collected with a Mar charge-coupled device at beamline BW6 at 
DESY, Hamburg. 



Cloning, Expression, and Protein Purification. The gene for 
the 89.8-kDa DNA-dependent pol (Tgo pol) was cloned from 
Tgo (Deutsche Samrnlung von Mikroorganismen no. 8976) and 
expressed in E. coli LE392pUBS520 pBtac27go (Deutsche 
Sammlung von Mikroorganismen no. 11328) as described (36). 
Tgo pol was purified essentially as described (36) with the 
substitution of the TSK butyl Toyopearl column by Blue- 
Trisacryl M (Serva) and with an additional concentration step 
on Poros 50 HQ anion exchange medium (Roche Molecular 
Biochemicals). Active fractions were combined, concentrated 
to 30 mg/ml, and transferred to 20 mM sodium phosphate, pH 
8.2/10 mM 2-mercaptoethanol/500 mM NaCl. 

The gene for a selenomethionine-containing variant of Tgo 
pol (Se-Tgo pol) was expressed in E. coli B842 (DE3) (hsd 
metB) using a published protocol (37). Sc-Tgo pol was purified 
by using the wild-type protocol. 

Crystallization. Crystals of purified Tgo pol (or Se-7go pol) 
were grown by using sitting-drop vapor-diffusion technique at 
4°C with high-salt conditions (2:2 /xl protein: reservoir — 100 
mM Tris, pH 8/2.0M ammonium sulfate) and diffracted to 3.0 
A (in-house) and to 2.5 A [beamline BW6 at Deutsches 
Elektronen Synchrotron (DESY), Hamburg]. Low-salt condi- 
tions (100 mM Tris, pH 7.0/200 mM ammonium sulfate/30% 
PEG 400) yielded only poorly diffracting crystals but allowed 
soaks (including heavy atoms) with some cell constant mod- 
ulation (a, b, c = 63.6, 105.0, 160.5) but minimal loss of 
resolution. 

Data Collection and Processing. Data were collected with a 
MAR imaging plate or a Bruker AXS X1000 mounted on a 
Rigaku rotating anode source, or with a MAR imaging plate 
or a MAR CCD (charge-coupled device) at beamline BW6 at 
DESY, Hamburg. The data were processed with saint (Bruker 
AXS), MOSFLM (Mar CCD; ref 38), or denzo (MAR imaging 
plate; ref 39), scaled with scala (40) or scalepack (39), and 
reduced with truncate (40). 

Structure Determination. The structure was solved by mul- 
tiple isomorphous replacement and anomalous scattering 
(MIRAS) by using data from crystals transferred to low-salt 
conditions (Table 1). Crystallographic calculations were done 
with programs from the CCP4 suite (40). Heavy atom positions 
of major sites were located in difference Patterson maps and 
were refined with mlphare (40) to calculate protein phase 
angles to 3.5 A resolution. A partial polyalanine model was 
built into interpretable portions of secondary structural ele- 
ments of the miras map by using main (41). The quality of the 
electron density was improved by phase combination of the 
partial model with the experimental phases by using sigmaa 



(40), and several cycles of solvent flattening to 3.0 A by using 
SOLOMON (40). At this stage, no interpretable density was 
found for a significant portion of the molecule, comprising 
residues 147-154, 283-306, 653-728, and 752-773. 

Model Building and Refinement. The partial model (R 
factor 35%) was used to phase the 2.5 A resolution data of the 
Se-Tgo pol (high-salt conditions). The model was oriented with 
amore (40). The correlation coefficient of 22.0% and the R 
factor of 50.3% showed divergence of the high- and low-salt 
structures. After bulk solvent correction, anisotropic B factor 
correction and rigid-body minimization (treating five domains 
independently), the partial model was iteratively refined and 
extended with simulated annealing, Powell minimization, re- 
strained individual B factor refinement with cns (42), and 
manual model building with main (41) by using data from 
25.0-2.5 A resolution (Table 2, Fig. 1). 

RESULTS AND DISCUSSION 

Structure of Tgo pol. Tgo pol is a ring shaped molecule with 
dimensions 50 A x 80 A X 100 A. The single polypeptide chain 
of 773 aa is folded into five distinct structural domains (Fig. 2): 
the N-terminal domain (residues 1-130), the 3' -» 5' exonu- 
clease domain (131-326), the palm (369-449 and 500-585), 
fingers (450-499), and thumb (586-773) domains of the 
polymerase unit, and a helical interdomain insertion (327-368) 

Table 2. Crystallographic refinement, high-salt form 
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Fig. 1. Stereorepresentation of the electron-density map. The 2 (F Q - F c ) electron density contoured at 1.0 a at the polymerase active site is 
well defined for the refined model (stick representation). 



between the exonuclease and palm domains. The polymerase 
unit forms the DNA-binding crevice, reminiscent of a right 
hand, which is the identifying characteristic of pols. Gp43 from 
bacteriophage RB69 also shows this overall domain topology 
(21). 

Three clefts extend radially from the polymerase active site 
at the center of the ring: two of them in opposite directions, 
forming a large cleft across the molecule, and one perpendic- 
ular to these. Based on active-site homology to pol A family 
enzymes, the two opposite clefts probably bind duplex DNA 
(cleft D, according to ref. 21) and single-strand template DNA 
(cleft T), respectively. The perpendicular (editing) cleft links 



the polymerase active site and the exonuclease active site and 
binds the primer strand in editing mode (21). 

The exonuclease domain is structurally equivalent to the 3' 
— > 5' exonuclease domain of pol A family (43). Like gp43, 
however, it is bound at the opposite side of the polymerase unit 
by noncovalent contacts to the thumb domain at the editing 
cleft, on one side, and by covalent and noncovalent contacts to 
the N-terminal and palm domains and the 42 residue inter- 
domain helix, on the other side. This latter segment is located 
at the base of cleft T, which is additionally bounded by the 
exonuclease, N-terminal, and palm domains. 

The topology of the palm domain is conserved among 
polymerase families (5), with two long helices (Q and R) 



Tgo DNA polymerase 
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Fig. 2. Structure of Tgo pol and comparison with gp43 form bacteriophage RB69. (Left and middle) Ribbon representation of Tgo DNA 
polymerase (Upper) in two orthogonal views with labeled secondary structure elements. The molecule is composed of five domains: N-terminal 
domain (yellow), 3' — >5' exonuclease (red), palm [light and dark magenta represent the N-terminal and C-termina! segment, respectively (see text)], 
fingers (blue), and thumb (green), which are arranged to form a ring. An interdomain helical segment between the exonuclease domain and the 
palm is orange. The conserved carboxylates in the active site and the two disulfide bridges are shown as magenta and yellow ball-and-sticks, 
respectively. Tgo pol has the same overall architecture and domain topology than gp43 of RB69 (Lower). The 50-residue insertion in the fingers 
of gp43 is gray. (Right) Comparison of molecular surfaces of Tgo pol (Upper) and RB69 gp43 (Lower). Red and blue denote negative and positive 
electrostatic surface potentials, respectively. In contrast to gp43, Tgo pol has a strongly enhanced positive potential at the putative DNA-binding 
clefts. 
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Fig. 3. Sequence alignment of B family DNA polymerases. The 
alignment has been adapted from ref. 21 to highlight specific residues 
from the class of archaeal pols. The secondary structure of Tgo pol is 
indicated on top of the alignment with helices (bars), strands (arrows) 
and loops (lines) colored according to domains with the same color 
code as Fig. 2B. Strictly conserved residues of type B polymerases are 
red, and additional conserved residues are yellow. Uniquely conserved 
residues of archaeal type B enzymes — as discussed in the text — are 
green. Disulfide bonds are shown by a bar on top of the alignment. 
Abbreviations: tgo, Thermococcus gorgonarius pol; pfu, Pyrococcus 
furiosus pol; tsp, Thermococcus sp pol; tli, Thermococcus litoralis pol; 
mvo, Methanococcus voltae pol; RB69, bacteriophage RB69 pol; T4, 
bacteriophage T4 pol; Eco\ E. coli pol II; pol S; human pol S. 

packed against the five-stranded antiparallel /3-sheet that 
contains the three conserved aspartate residues involved in 
nucleotidyl transfer. The fingers emerge from the palm domain 
as an a-helix-rich insertion. Its 50 residues are folded into two 
antiparallel coiled a-helices of approximately equal size: helix 
P contains the conserved Kx 3 NsxYGx 2 G motif of B type 
polymerases and is related to the O helix of A type enzymes 
(see below). The ^50-residue insertion between helix O and P 
in RB69 and T4 gp43 is missing in Tgo pol, where both helices 
and 4 residue linker are much shorter than their equivalents in 
gp43. The shorter fingers of Tgo pol presumably reflect the 
typical structure of the nonbacteriophage B type fingers (pol 
a, pol 5, and E. coli pol II). The thumb domain topology, 
similar to that of gp69, is unrelated to other polymerase types. 
However, like the thumb of A type enzymes, a bundle of 
a-helices at its base protrude from the active site /3-sheet. 
Distal to the active site, the thumb contains a 75-residue 
subdomain (665-729), which fixes the exonuclease domain and 



contributes to the editing channel, explaining why mutations in 
the exonuclease domain of B-type polymerases affect the 
polymerase activity and vice versa (44; 45). 

Weakly defined density across the base of the thumb domain 
was modeled as the C-terminal 6 residues with a polyalanine 
chain. The C terminus thus does not protrude from the core 
molecule as in the RB69 polymerase (21). Because the C 
terminus of the T4 pol are involved in sliding-clamp binding 

(46) , it is likely, however, that these residues become ordered 
on any similar holoenzyme formation. 

Sequence Alignment of Archaeal DNA Polymerases. The 
structure of Tgo pol allows the generation of a structure based 
sequence alignment of the archaeal subfamily of type B DNA 
polymerases, the location of conserved and unique residues, 
and the comparison with other type B DNA polymerases (Fig. 
3). 

Polymerase Active Site. The polymerase active site is formed 
by the central j3-sheet (strands 16, 17, 20, 21, and 22) and helix 
N of the palm domain and helix P located in the fingers and is 
highly conserved among B family polymerases (Fig. 4). Three 
carboxylates required for nucleotidyl transfer in B family 
polymerases, two of which coordinate two metal ions (14) are 
superimposably conserved among A family enzymes, B family 
enzymes, and reverse transcriptase (21). Superposition of Tgo 
pol and T7 replication complex (14) places the dNTP near the 
proposed nucleotide-binding site in helix P, the 
K487x3NSxYGx2G motif (Fig. 3) and suggests interactions of 
the carboxylates with metals and the phosphate tail of the 
bound dNTP (Fig. 4). Reorientation of the strictly conserved 
Lys-487 allows it to mimic the Lys-522-phosphate tail inter- 
action in T7. Tyr-494 (Kx 3 NSxY494Gx 2 G) and Tyr-409 
(SLY409PSII) form the bottom of the nucleotide-binding site. 

The active site of B family polymerases contains a DTDS 
motif, which, however is DTDG in the archaeal subfamily. In 
Tgo pol, the relatively conserved Tyr-402 from the adjacent 
strand provides an alternate alcohol group, at a position 
appropriate for metal coordination or binding of the 3' end of 
the primer. The orientation of Tyr-402 is stabilized by an 
aromatic cluster that also includes Phe-545 and Tyr-538. 
Archaeal Methanococcus voltae and Thermococcus sp. pol's 
(see Fig. 3) have Tyr at position 545 — Phe in Tgo — rather than 
at 402, but might also supply an alcohol group. The displace- 
ment of a functional alcohol from serine in DTDS to Tyr-402 
or Tyr-545 might stabilize its orientation as an adaptation for 
thermostability. 

The conserved cluster of acidic amino acids (E578, E580) 
form an unexpected metal-binding site for Mn 2+ and Zn 2+ 
(Fig. 4). Its proximity to Asp-404 and to the expected location 
of the dNTP *y-phosphate suggests a supporting role in nucle- 
otide binding and/or catalysis. 

3' — > 5' Exonuclease Active Site. Tgo pol is characterized by 
a strong 3' — > 5' exonuclease activity, unlike eukaryotic B type 
polymerases (unpublished results). The exonuclease active site 
is formed at the interface between the exonuclease domain and 
the tip of the thumb (Fig. 5). All residues required for catalysis 
are located in the exonuclease domain, which, at least for T4 
gp43, retains activity when dissociated from the polymerase 

(47) . However, the thumb domain, with, for example, RB69 
gp43's Phe-123-base intercalation, partially controls the bind- 
ing geometry of single strand DNA (21, 43). 

The exonuclease structures of Tgo and gp43 DNA poly- 
merases are similar at the editing site but differ considerably 
at the exonuclease-thumb interface. Strand 10 contains the 
metal-binding D141IE motif and readily superimposes with 
the equivalent strand from gp43, allowing modeling of a 
single-strand DNA segment into the exonuclease site based on 
theRB69 gp43-p(dT) 4 complexes (21). The conserved residues 
Asp-141 and Glu-143 in the Exo I motif, Tyr-209, Asn-210, 
Phe-214, and Asp-215 in Exo II, and Tyr-311 and Asp-315 in 
Exo III are in approximate DNA-binding conformations (Fig. 
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Fig. 4. Polymerase active site. (A) Stereoribbon representation (using color code as in Fig. 2) with modeled DNA. Active-site residues are shown 
as ball-and-stick representations with carbon (green), nitrogen (blue), and oxygen (red) atoms. The DNA template (light brown), primer (light 
brown), and dNTP (orange) complex has been taken from the coordinates of T7 replication complex (15) by superimposing D404, D542, and 
adjacent residues with corresponding residues in T7 pol (D475 and D654). Phosphorus atoms are yellow. The two metals of the T7 replication 
complex are shown as magenta spheres. (B) Experimentally observed metal-binding site for Mn 2+ (F Q - F c density contoured at 5<r) and Zn 2+ 
in the "low salt" crystal form. The carboxylates E578 and E580 are conserved in type B polymerases (Fig. 3). 



5A). However, the editing cleft is constricted by a displacement 
of the tip of the thumb toward the exonuclease domain to 
prohibit single-strand binding (Fig. 55). This shift is correlated 
with a large change in the loop between strands 10 and 11. In 
RB69 gp43 (and likewise in T4 gp43), this loop forms a lid over 
the 3' base and contains Phe-123, which intercalates between 
the first two bases. In Tgo pol, the loop is curved outward, away 
from the thumb, and Phe-152 (the equivalent of gp43's Phe- 
123) attaches to Phe-214 10 A away from intercalation site. 
This shift allows the tip of the thumb to move into the editing 
channel and to block the exonuclease site. 

Are There Different Conformations in Polymerase and 
Editing Mode? If a closed conformation of the exonuclease 
domain prohibits single strand binding, an open conformation 
is required for editing. The observed closed conformation may 
represents the enzyme in "polymerase" mode. Preliminary 
analysis of the crystal structure of Tgo pol in the low-salt 
conditions indicates a structural change at the interface of 
exonuclease and thumb, possibly reflecting a transition be- 



tween open and closed forms. The closed conformation ob- 
served here may, however, be a nonphysiological artifact of the 
high ionic strength used for crystallization. Crystal structures 
of the enzyme in both polymerase and editing modes are 
required. 

Adaptation to High Temperatures. Tgo is a sulfur- 
metabolizing, extremely thermophilic archaeon, with a growth 
range between 55°C and 98°C For accurate replication at this 
temperature range, the polymerase must not only be stable, but 
must also adequately bind substrate DNA. A comparison with 
gp43 from the mesophilic bacteriophage RB69 indicates sev- 
eral such adaptations to high temperatures. Several loops are 
shorter in Tgo pol than in gp43 (Fig. 2), and there is an increase 
in hydrogen bonded j3-strand content: Tgo pol secondary 
structure includes 41% helix, 22% j3-strands, and 19% turns 
(calculated according to ref. 48), whereas gp43 has 42% helix, 
17% /3-strands, and 19% turns. 

Although rare among cytoplasmic or nuclear proteins, two 
disulfide bridges might be formed: cysteine pairs 428-442 and 




Fig. 5. 3' -> 5' exonuclease active site. (A) Stereoribbon representation with modeled DNA using the color code of Fig. 4 A (ball-and-stick) 
and Fig. 2 (ribbons). Active-site residues are shown as ball-and-stick representation. The single-stranded DNA has been taken from the coordinates 
of RB69-single-stranded DNA complex (21). The orientation of the DNA has been obtained by superimposing the DIE143 motif in Tgo pol with 
corresponding motif of RB69 gp43 (DIE116). Strand 27 and its preceding loop from the thumb (green) is apparently in collision with the modeled 
DNA. (B) Comparison of the exonuclease -thumb interface between Tgo pol (color code of Fig. 2B) and RB69 gp43 (gray). In Tgo pol, the lid of 
the editing site (red) is bent outward compared with the equivalent loop of gp43 (yellow), allowing the tip of the thumb to move several A (\) 
closer to the exonuclease. This conformation is incompatible with formation of an editing complex [the p(dT) 4 of gp43 is shown as brown space-filling 
model]. 
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506-509, although reduced, are poised for attachment (Fig. 2). 
Model refinement and electron density inspection with and 
without constraints for the disulfide bonds verified the reduced 
state (observed unrestrained SG-SG distance: 2.8 A and 3.0 
A). This is consistent with our E. coli expression and further 
rules out structural perturbation by nonnative oxidation. These 
cysteines are located in the palm domain and are conserved 
among B type enzymes from hyperthermophilic sulfur- 
metabolizing archaeons, but not among mesophile homologs 
(Fig. 3). The Cys-428-Cys-442 bridge stabilizes the compact 
fold of the loop segment between helix N in the palm domain 
and helix O in the fingers and presumably also the relative 
orientation of these helices. In addition, the loop segment 
packs against helix Q in the palm domain. Helix Q, the spine 
of the palm domain, is further stabilized at the first helical turn 
by the second disulfide bridge between Cys-506 and Cys-509. 

A much enhanced complementary positive potential for all 
three DNA-binding clefts of Tgo pol is observed relative to 
gp43 (Fig. 2). Thus, in addition to hydrogen bonding and 
specific DNA-protein interactions, binding to Tgo pol has an 
additional strong stabilizing electrostatic component. 

An increase in the number of salt bridges is often associated 
with thermostability. Although Tgo pol has a greater total 
number of charged residues (262) than gp43 (245), both 
molecules have 54 salt bridges within a 3-5 A bound. However, 
in the 5-7 A range of charge distance, Tgo pol has 77 ion pairs 
compared with 43 for gp43. This large increase results in a 
more highly charged surface of Tgo pol, accompanied by a 
more balanced charge distribution, compared with gp43 where 
charges are often located in patches (Fig. 2). 
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